States versus Rewards: Dissociable Neural Prediction Error Signals Underlying Model-Based and Model-Free Reinforcement Learning|Neuron(2010)
Jan Gläscher, Nathaniel D. Daw, Peter Dayan, John P. O'Dohert
DOI: https://doi.org/10.1016/j.neuron.2010.04.016
強化学習(Reinforcement Learning; RL)
モデルフリー強化学習(Model-Free RL)
予測報酬誤差(Reward-Prediction Error; RPE)
腹側線条体(ventral striatum)
腹側被蓋野(ventral tegmental area; VTA)
モデルベース強化学習(Model-Based RL)
状態予測誤差(State Prediction Error)
前頭前皮質(prefrontal cortex; PFC)
頭頂間溝(Intraparietal sulcus; IPS)